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Summary 

A two-step semidirect procedure is developed to accelerate 
the one-step procedure described in NASA TP-2529. For a 
set of constant coefficient model problems, the acceleration 
factor increases from 1 to 2 as the one-step procedure 
convergence rate decreases from +oo to 0. It is also shown 
numerically that the two-step procedure can substantially 
accelerate the convergence of the numerical solution of many 
partial differential equations (PDE’s) with variable 
coefficients. 


Introduction 

The iterative procedure developed in reference 1 requires 
only a single application of a fast direct solver (FDS) in 
advancing u n to u" +1 . In the present report the following two- 
step iterative procedure is investigated: 


Pw n = - r(Qu n -h ) (r*0) (1) 

and 

P(u n+{ -u n ) =Rw n (2) 

With the understanding that the notations defined in reference 
1 are used in the present report, the only parameters in 
equations (1) and (2) yet to be defined are w", the 
intermediate iterative variable, and R , an elliptic linear operator 
whose exact form will be chosen to accelerate the convergence. 
With the assumptions that (1) u n — u and w n — w as 
n — + oo , and (2) the inverse of R exists, it may be concluded 
that w = 0 and u is a solution of equation (1-1). (Here, as in 
the following, the equation number (1-1), for example, refers 
to eq. (1) of ref. 1.) 

If one assumes that the computational effort required to 
advance u n to u n+l with the two-step procedure is twice that 
required with the one-step procedure, the former has no merit 
unless it can achieve a convergence rate at least twice that of 
the latter. Fortunately, for a large class of the operator Q an 
operator R can be found such that the two-step method 
convergence rate is 2 to 4 times that of the one-step method 
convergence rate. Moreover, as will be shown, the higher 
acceleration factor is realized whenever the one-step method 
convergence rate is lower. 


In the section Analysis the convergence of the two-step 
procedure is investigated in a fashion similar to that presented 
in the section Analysis of reference 1. In the section Local 
Relaxation the results obtained in the section Analysis are 
extended to solve partial differential equations (PDE’s) with 
variable coefficients. Finally, in the section Numerical 
Evaluation the two-step method is numerically compared with 
the one-step method by using two-dimensional (2-D) and three- 
dimensional (3-D) test problems. 


Analysis 

In this section the two-step procedure defined in equations 
(I) and (2) is studied by assuming that r is a constant and 


;*2 o 2 

R&- a '— r + 2b'— + 
dx 2 dxdy 



( 3 ) 


where a ' , & ' , and c ' are constant coefficients to be determined 
later. Furthermore it is assumed that the operators Q and P 
are those defined in equations (1-3) to (1-5). The central 
difference forms of equations (1) and (2) can be expressed as 

P( w ij) = “^ (?(</) - *„] ( 4 ) 

and 

p(u^ 1 -ulj) =*(<;) (5) 


respectively. Here Q and P , respectively, are defined in 
equations (1-8) and (1-9), and 


RiVij) 1 **- a'(Ax)- 2 (y i+lJ + v,_ 1(/ - 2v,J) 

+ c'(Ay) _2 (v,y+i + v (V _, - 2vjj) 

+ b'(2AxAy)~'(v i+iJ+l + 

- vi+ij-i -Vi-ij+i) ( 6 ) 

for any v u . 


This report has been reprinted so that the graphs of figures 1, 2, 4, and 5 are in the correct locations. 
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To study the convergence rate of the iterative procedure 
defined by equations (4) and (5), one notes that equations (4), 
(5), and (1-6) imply that 

P{<j) = - rQ(eJj) (7) 

and 

-</)=*(<,) ( 8 ) 

where e"j is defined in equation (Til). With the assumption 
that both </s and efy’s satisfy the periodic and uniqueness 
conditions as given in equations (M2) to (1-14), a line of 
arguments similar to that presented in appendix A of reference 
1 can be used to show that the unique solution e "j to equations 
(7) and (8) is 



(£=0,1,2,..., (AT— 1); f=0,l,2,,.., 


(*,*)€* 

0 n = 1,2,3,...; ij = 0, ± 1, ± 2,...) (9) 

where 

d l f - 1 - r( 7 <*>'> . 7 ' <*-'>) (kj) € * (10) 

The only parameter in equations (9) and (10) yet to be defined 
is 

7 = f (U) 6 * (id 

where 


A*\ /rfO 

cos — cos — ) 

W \lJ j 

-D) (12) 


Note that here, as in the following, the symbol = is used to 
designate a parameter for the two-step procedure in case this 
parameter is different from its counterpart in the one-step 
procedure. 

At this juncture note that equation (9) is identical to equation 
(1-21) except that G (t) in equation (1-21) is replaced by 
G (kt0 (r) in equation (9). Thus for the two-step procedure the 
counterpart of equation (1-26) is 


(W)def. ( a Q k ' l> \ 

y ' wv 

in G (W) (r) is replaced by the parameter 

y (k,l) def. (k,l ) . / (k,t) (/(;,£) g 'p 


M°° = G(r) (13) 

in G (kJ \ T ). 

where 


G(T) d 5 f Max 




(14) 


Let 

a' > 0 


Furthermore the only difference between G (W) (r) and 
G ( * °(t) is that the parameter 


c' > 0 
a'c' ~(b') 2 > 0 


(15) 


(16) 
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and 


(24) 


d'¥~ 


C '¥~ 
C n 


( 17 ) 


£ ' d £ f - 


b' 


Va^ 


A line of arguments which leads to equation (I-B18) can be 
used to show that 


X^ x > y' (w > > X^ n > 0 (*,£) € * 

where 

>4«= f ' + c' +V(a' -c') 2 + 4(fc') 2 ] 

and 

X^ f - l[a' +«' -V(a' _c')2 + 4(S') 2 ] 

Let 

and 

7 • * Min 1 

2™"- (W)€*U j 


( 18 ) 


(19) 


( 20 ) 


( 21 ) 


then equations (18) and (I-B18) imply that > 7 min > 0. 
As a result one may conclude that (see equations (1-27) to 
(1-29)) G(r) reaches its minimum 


G° d f G(tJ = s < i 


E - 1 


E + 1 


( 22 ) 


when 


T ~ T ° d J f - 


Tjmax “f Tmin 


(23) 


Here if is the optimal relaxation factor for the two-step 
method, and 


£def. 5V 
7 min 


> 1 


With the assumption r = r°, it can be concluded from 
equations (13) and (22) that (1) Af 00 < 1 and (2) M°° 
increases with an increase of E. 

If the coefficients a , b, c, and c 0 ; the aspect ratio 
(Ay/ Ax); and the integers K and L are known, the parameter 
E is a function of the coefficients a\b ' , and c' . Since the 
convergence rate increases with a decrease of E, ideally the 
coefficients a', b' , and c' should be chosen such that E is 
at its minimum. Unfortunately this optimization problem is 
too complicated to be solved by a simple analytical technique; 
thus, a simplified version of this optimization problem is 
solved. 

To proceed, one notes that E can be considered as a function 
of K and L if the parameters a, b, c, a ' , b ' , c ' , a 0 , c 09 and 
(Ay/Ax) are known. In the appendix the existence of 


E°° d J f Sup (e] 


(25) 


is established for any given set of a, b, c, a ' , b ' , c ' , a 0 , c Q , 
and (Ay/Ax). Note that in this report and reference 1 
supremum and infimum of a finite set are denoted by Max 
and Min, respectively. On the other hand, supremum and 
infimum of an infinite set are denoted by Sup and Inf, 
respectively. Let 


^ * def . (^max + \nin f_Ja + c ) 2 


4X max A mm 4(dc - b 2 ) 


(26) 


where X max . X nun , a, b, and c are defined in equations (1-30) 
to (1-32). It is shown in the appendix that (1) E“ > E* and 
(2) E°° = E* if and only if 


DD’ = AI (A > 0) 

where I is the 2 x2 identity matrix, 

* /a b' 


(27) 



(28) 


and A an arbitrary positive scalar. In this report the values 
of a ' , b ' , and c ' are determined by using equation (27) with 
the assumption A = 1. Thus, 


3 


(36) 


a 0 c 

ac - b 2 


c ' 


CcA 

ac - b 2 


b ' 


ac - b 2 


(29) 


Equation (29) obviously is consistent with equations (1-4) and 
(16). 

With the coefficients a ' , b ' , and c' specified according to 
equation (29) it is shown in the appendix that 


Qnun 


def. 


(gL) 2 
2 - ( e; in ) 2 


b 2 

lac — b 2 


when c 0 la 0 = da. 

In the limit of K and L — > +oo the parameters G* and G* 
are the asymptotic error multiplication factors for the one- and 
two-step methods, respectively. Assuming that the execution 
of one iteration in the two-step procedure requires twice as 
much computational time as that in the one-step procedure, 
one may conclude that asymptotically the two-step method is 
faster than the one-step method by a factor of £ if 


G* = (G’) 2$ (1 > G" > 0) 


^ — Tmax — J^nnn — 1 

and 


(30) 

With the aid of equation (34) one obtains 


lilTl Tmax 
K,L — * + oo 

lim Tmin=l 

K.L- + 0 , = 


(31) 


a = «G*) d ^ f 


In [2 - (G*) 2 ] 
ln[(G*) 2 ] 


(1 > G* > 0) 


(37) 


Since 


Thus, in the limit of K and L— + oo the parameters if and 
G°, respectively, approach 


•def. ^ 

= “ E* + 1 
and 


G* d l f - 


L'-l 
E* + 1 


< 1 


(32) 


(33) 


dm 

dx 

and 


> 0 (1 > x > 0) 


lim m = 1 

jc— C r 


lim $ (x) = 2 

X— 1 


(38) 


(39) 


If equations (33), (26), (1-36), and (1-37) are used, it can be 
shown that 


G* = p(G*) d = f - — ) . 

2 - (G ) 

Since 


dp _ 4G* 

dG' ~ [2- (G*) 2 ] 2 > 


(34) 


(35) 


the acceleration factor £ increases from 1 to 2 monotonically 
as the value of G* increases from 0 to 1; that is, the higher 
value of £ is realized whenever the need to accelerate the one- 
step method is greater. (See also fig. 1.) 

In a fashion similar to that described in the section Analysis 
of reference 1 , the two-step procedure described in this section 
can be generalized for a space of higher dimension. However, 
it should be noted that in an N-dimensional space ( N > 2) the 
generalized version of equation (27) represents only a subset 
of the general solution to the condition E = E°° (ref. 2). 


for 1 > G >0, one concludes that p is a simple 
monotonically increasing function of G\ Thus, the same 
value of cja 0 which optimizes G* (see equations (1-39) and 
(1-40)) will also optimize G\ In other words the value of G* 
reaches its minimum 


Local Relaxation 

With the technique of local relaxation developed in the 
section Local Relaxation of reference 1 , the two-step procedure 
described in the section Analysis can be extended to solve 
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Figure 1.— Function £(*) = 1 - [In (2 - jc 2 ). 


PDE’s with variable coefficients (VC’s). Thus, with the aid 
of equations (26) and (32) one concludes that equations (1-47) 
and (1-58) are replaced by 


Thus, the VC versions of equations (4) and (5) imply that 



In other words the extra computational effort required for the 
two-step method is completely wasted, since the convergence 
achieved in one iteration is identical for both the one- and two- 
step methods. Note that a similar situation also arises in the 
solution of the PDE’s with constant coefficients. Let a = c 
and b — 0. Then G* = G* = 0. In other words the machine 
accuracy solution is obtained in one iteration for both the one- 
and two-step methods. Since the parameter | is ill defined at 
G* = 0 (eq. (37)), the assertion made in the section Analysis 
concerning the advantage of using the two-step method 
apparently is not valid for the special case in which a — c and 

b = 0. 

Finally, it is noted that equations (34), (35), and (1-52) to 
(1-55) can be used to show that 


(1) M°° ~ G°° = p(G°°) 


(42) 


u 



(.Ag + tg ) 2 T [ 
4 [Vtf ~ (6y) 2 L> 


(40) 


(2) For the case in which b tJ = 0 for all (/,/) € 4>, G 00 
reaches its minimum 


and 


Lj = 2 


! (Pij + %> 2 ' 


(41) 


in the two-step procedure. 

Moreover, the VC versions of equations (6) and (29) are 
obtained by substituting the coefficients a\ b' , c', a' , b ' , 
and c', with a-, b~, c\ p a ip b ip and Cy. For the case when 
the operator Q is a self-adjoint operator defined in equation 
(1-56), these VC versions are modified by replacing a ip c ip 
and bij with p ip q ip and 0. It should be cautioned that for the 
special case where = q { j 



GZn d = l P(G“ n ) 


(43) 


if and only if c 0 /a 0 =\l |3 max • Equation (42) combined 

with equations (34) and (37) suggests that the parameter 


S(G°°) d i f 1 


ln[2 - (G°°) 2 ] 
ln[(G°°) 2 ] 


(44) 


may be used to predict the numerical acceleration factor 


y / n \ def. [G r (n)] two-step procedure 
srvv = r/ ^ 

[C/ r (Z/2)Jone-step procedure 


where O r (n) is defined in equation (1-61). 


(45) 


Numerical Evaluation 
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Initially the test problems used in the comparison of the one- 
and two-step methods involve the constant coefficient finite 
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difference equations defined by equations (1-6) and (1-8). These 
problems were designed such that many key results of the 
current theoretical development could be tested numerically 
under the most ideal conditions. As a preliminary a line of 
arguments similar to that given in appendix A of reference 
1 is used to obtain the following results: 

(1) Let 

hij — fy+Kj - hij+ L (*J = 0, ± 1, ± 2,...) (46) 

and 

(tf-l) (L— 1) 

£ 2>.V = ° (47) 

i—0 7=0 

where K > 2 and L > 2 are two arbitrary integers. Then s 
are uniquely determined by equation (1-6) and the following 
auxiliary conditions: 

“«j = = W/j+L OV = 0, ± 1, ± 2,...) (48) 


and 

(a:-D (l- l) 

L L «y = 0 (49) 

1=0 7=0 

(2) The unique solution to equations (1-6), (48), and (49) 
is explicitly given by 

«.v= L dv = 0 , ± l, ± 2 ,...) ( 50 ) 


where is the complex conjugate of 
For the test problems defined in table I the hy's are chosen 
to meet the conditions (46) and (47). They are given by either 





= £ cos 


2xk • i 
~K 


cos 



(53) 


The w "/ s and wf/s are assumed to satisfy the periodic and 
uniqueness conditions given in equations (48) and (49). 
Moreover, it is assumed that (1) a = c = a 0 =c 0 = 1, (2) 
Ax = Ay = 1 IK - 1/L, and (3) the relaxation factors r and 
r\ respectively, are used in the one- and two-step 
procedures. Note that equations (1-38) and (1-41) coupled with 
assumption (1) yield 

t* = t°= 1 . 


Problems 1 to 4 are studied first. Since = 0 for every 
i and j 9 equations (50) to (52) can be used to show that the 
only surviving E°' (kJ)y s are E °’ (U) , E°^ L ~ { \ and 

g0,(K-LL-i)' Moreover, since K = L, 


The only parameter in equation (50) which was not defined 
previously is 

«<*,') =f L E htjpfP (*.<)€♦ ( 51 ) 

1=0 7=0 


7 (1.1) _ 7 (AT-1,L-1) 


Tmax if * > 0 
Ymin if b < 0 


and 


(54) 


TABLE I.— DEFINITIONS OF PROBLEMS 1 TO 8 AND VALUES OF KEY PARAMETERS 


[a = c = a B = c a = 1, A* = Ay = 1 IK = 1/L.] 


Problem 

h -j 

b 

K=L 

-log 10 (G') 

-log l0 (G°) 

-log, 0 (G') 

— log, 0 ( 1 - r 2min> 

S(G’) 

V 

1 

eq. (52) 

0.25 

16 

0.6020600 

0.6189121 

1.491362 

1.561608 

1.238549 

1.261575 

2 



.25 

64 

.6020600 

.6031069 

1.491362 

1.495559 

1.238549 

1.239879 

3 



.875 

16 

.05799195 

.07484407 

.2074310 

.2776769 

1.788447 

1.855036 

4 

1 


.875 

64 

.05799195 

.05903883 

.2074310 

.2116287 

1.788447 

1.792284 

5 

eq. 

(53) 

.25 

16 

.6020600 

.6189121 

1.491362 

1.561608 

1.238549 

1.261575 

6 



.25 

64 

.6020600 

.6031069 

1.491362 

1.495559 

1.238549 

1.239879 

7 



.875 

16 

.05799195 

.07484407 

0.2074310 

0.2776769 

1.788447 

1.855036 

8 



.875 

64 

.05799195 

.05903883 

0.2074310 

0.2116287 

1.788447 

1.792284 
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7 <*-U) _ y(U-» = 


Ymin if b >0 
7max if b<0 


These considerations coupled with other equations given 
previously lead to a simple formula for the residual norm of 
the one-step procedure: 


( (K-l) (L-\) r , J 

s E 2(<,) 

C i— 0 7=0 L I 


(^ = L) 


where G° is the parameter defined in equation (1-27). Thus, 
one concludes that in the absence of roundoff error the relation 
between O r {ri) and n for problems 1 to 4 are represented by 
straight lines if the one-step method is used. The slope of each 
straight line is — log 10 (G°). 

Similarly by using the fact that 


T (U) _ 7 (*-l,l) 


= 7 (U-1) 




(K = L) 


one concludes that for the two-step procedure 




Thus, in the absence of roundoff error the relation between 
O r (n) and n for problems 1 to 4 are also represented by 
straight lines if the two-step method is used. The slope of each 
straight line is -log 10 (1- r*2min)* 

The predictions given by equations (56) and (58) are 
confirmed by the numerical results shown in figures 2 and 3. 
The slopes of the upper two curves in figure 2 and all four 
curves in figure 3 agree with the predicted values for at least 
the first seven significant digits. (The accuracy of the 
computation is double precision on the IBM 370.) This is also 
true for the lower two curves in figure 2 before the roundoff 
error becomes dominant. Furthermore, these two curves 
quickly settle into horizontal lines as soon as the roundoff error 
becomes dominant. 

These numerical results indicate that the roundoff errors 
never grow during the entire convergence histories of problems 
1 to 4. This is also consistent with our theoretical development. 
Recall that a roundoff error introduced at any stage of the 
iterations can be considered as a linear combination of 



n 


Figure 2.— Convergence histories of problems 1 and 2. 



0 8 16 24 3 2 40 


n 

Figure 3.— Convergence histories of problems 3 and 4. 

Because of the uniqueness condition (I-A12),the k=0 
and £=0 component of this combination is annihilated during 
the next iteration. On the other hand, each of the remaining 
components is multiplied by either G ( * ,£) (t*) (one-step 
method) or G ( * ,£) (/) (two-step method) as the iteration 
number n increases by 1. With the aid of equations (1-33), 
(1-35), (30), and (32), it can be shown that 

< 1 and G ( *’ £) (/) <1 (MK* 

For problems 1 to 4, the acceleration factor £ r (rt) defined 
in equation (45) is virtually equal to 
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0 


>,def. l0gl ° (1 L2™»> r50) 

* ' 2 log,„(G°) <59) 

before the roundoff error becomes dominant. The value of £ ' 
as well as the values of five other parameters are listed in the 
last six columns of table I. These parameters, according to 
equations (1-34) and (31), are related by the following limit 
equations: 


lim log 10 (G°) = log )0 (G') 

(60) 

AT.Z.-+0O 


ArL- 1 ?=0 logl0 ( 1_ ='^ min ) =lo §io(G') 

(61) 

and 


lim £'=£(<?*) 
K.L-+™ 

(62) 


o Problem 5 
° Problem 6 
* Problem 5 
Problem 6 


Two-step method 


| One-step method 



16 20 

Figure 4.— Convergence histories of problems 5 and 6. 


Note that the parameters on the left sides of equations (60) 
to (62) depend on the aspect ratio AylAx but not on the 
individual values of Ax and Ay. Thus, the variation of Ax and 
Ay is allowable as the integers K and L approach infinity as 
long as the ratio AylAx is held constant. Since Ay/ Ax = 1 for 
problems 1 to 8, one may expect that the values of the 
parameters on the left sides of equations (60) to (62) approach 
more closely to the values of the corresponding grid- 
independent parameters on the right sides as the values of K 
and L increase from K = L = 16 to AT = L = 64. The values 
shown in table I confirm this expectation. 

As also shown in table I, the values of -log 10 (G°) and 
— logio(l - / 7min)> and thus the actual convergence rates of 
the one- and two-step procedures, may be substantially 
underestimated by the values of -log 10 (G*) and -log 10 (G*), 
respectively, if b is relatively large, and K and L are relatively 
small. As a result the convergence rates tend to be more 
sensitive to the change of the integers K and L if the value 
of bl^fac is closer to 1. (See fig. 3.) 

For problems 5 to 8 the s are specified according to 
equation (53). With this choice of the source term all E°’ M 's 
survive. As a result the relation between O r {n ) and n for 
problems 5 to 8 are no longer represented by straight lines. 
However, as expected from theoretical considerations, the 
slope of any curve shown in figures 4 and 5 approaches either 
-\ogio(G°) (one-step procedure) or -log 10 [G( /)] (two-step 
procedure) before the roundoff error becomes dominant. Note 
that G(/) = G* if a' = c' . 

The initial comparisons of the one- and two-step methods 
in regard to their ability to solve PDE’s with variable 
coefficients involve problems 1 to 17 of reference 1. The 
relative efficiencies of these two methods, measured in terms 



Figure 5.— Convergence histories of problems 7 and 8. 

of the parameter £ r (n ) (eq. (45)), are given in table II. It is 
seen that (1) the convergence is accelerated, that is, 
% r (ri) > 1, by the two-step procedure in only six test cases, 
and (2) without any exception, £ r (n) is smaller than the 
corresponding theoretical parameter £(G°°), and the 
discrepancy is rather large for the test problems with rapidly 
varying coefficients, that is, problems 13 to 15 of reference 1. 
These disappointing results, however, are not surprising 
because of the following considerations : (1) the technique of 
local relaxation, obviously, is less viable in the case of the 
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TABLE n.-VALUES OF £(G°°) AND £ r (n) FOR PROBLEMS 1 TO 17 
OF REFERENCE 1 


[For problems 1 to 5, n = 10; for problems 6 to 15, n = 16; and for problems 16 
and 17, n = 5.] 



Problem 

1 

2 

3 

4 

5 

6 

7 

8 

9 

t«n 

M0 

1.234 

1.077 

1.237 

1.133 

1.228 

1.144 

1.234 

0.904 

1.236 

1.024 

1.402 

0.962 

1.403 

1.068 

1.394 

0.730 

1.403 

0.977 


Problem 

10 

11 

12 

13 

14 

15 

16 

17 


n<n 

1.403 

0.897 

1.402 

0.804 

1.289 

0.809 

1.289 

0.521 

1.289 

0.347 

1.289 

0.169 

1.336 

0.793 

1.367 

1.216 



two-step procedure, and (2) the one-step method convergence 
rates associated with these test problems are all relatively high, 
and thus the advantage of using the two-step procedure is 
greatly reduced. 

To demonstrate that the two-step method could be 
substantially faster than the one-step method for the test 
problems with low one-step method convergence rate, 
problems 9 and 10 are introduced, which are modified versions 
of problems 16 and 17 of reference 1. The modification 
involves only the enlargement of the domain of equation (1-69) 
from 1>*>0 and l>y>0 to 1.5 >jc> 0 and 1.5>y>0. A 
comparison between table II of reference 1 and table III of 
this report reveals that this simple modification results in a 
large reduction in the values of 0,(10) and thus the one-step 
method convergence rates. Furthermore, it can be seen that 
the two-step method, as indicated by the values of £ r (5) and 
£ r (10), is indeed substantially faster than the one-step method 
for problems 9 and 10. 

Problems 18 and 19 defined in reference 1 are associated 
with a self-adjoint PDE (1-70) in which the coefficients p and 
q are identical. As noted in the section Local Relaxation, for 
these test problems the two-step method is inferior to the one- 
step method. 

The next test problems to be discussed are those defined 
in table III of reference 1 . If the values of cja 0 used in the 
iterations are chosen according to equation (1-54), the 
computational efficiencies of the one-step and two-step 
methods are about equal, as shown in table IV. It is also shown 
that for the test problem associated with equation (1-74) the 
convergence rates are sharply reduced if cja 0 = 1 is assumed 
in the iterations. In this case the computational efficiency of 
the two-step method is about twice that of the one-step method. 

Finally, the one- and two-step methods are compared by 
using the three-dimensional test problem defined in the section 
Application to a 3-D Flow Problem of reference 1. As in 


TABLE III.— VALUES OF KEY PARAMETERS FOR 
PROBLEMS 9 AND 10 


Problem 

cJ<*o 

0,(10) 

*(G“) 

M 5) 

1,(10) 

9 

“0.1239 

1.083 

1.654 

2.009 

1.840 

10 

“0.1065 

.928 

1.698 

1.741 

1.745 


Evaluated from eq. (1-54). 


TABLE IV. -VALUES OF KEY 
PARAMETERS FOR TEST 
PROBLEMS DEFINED 
IN TABLE ID OF 
REFERENCE 1 


Equation 

Solution 

method 

cJ a o 

T, 

sec 

(1-73) 

One-step 

“0.8839 

1.871 

(1-73) 

Two-step 

a .8839 

1.531 

d-74) 

One-step 

a 9.989 

.888 



Two-step 

"9.989 

1.050 



One-step 

1.0 

14.04 



Two-step 

1.0 

7.61 


Evaluated from eq. (1-54). 


reference 1 it is assumed that p\ = P2 = 1 and p 3 = 0 . 4135 . 
The values of £ r (4) obtained during the 25 passes through the 
inner loop range from 1 . 140 to 1 . 308 , with the average being 
1 . 256 . 

Concluding Remarks 

A two-step semidirect procedure was developed to accelerate 
the one-step procedure described in reference 1. The 
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acceleration may be substantial for elliptic problems which 
have low convergence rates with the one-step procedure. 

A key element in the development of the two-step procedure 
is to choose the coefficients a', b' , and c' such that the 
convergence rate (in a sense defined in the section Analysis) 
can be maximized. This optimization problem is solved by 
equation (29). With the coefficients a ' , b ' , and c' so chosen 
a simple monotonic relation exists between the asymptotic 
error multiplication factors for the one- and two-step 
procedures. This relation not only enables us to establish the 
superiority of the two-step procedure compared with the one- 
step procedure, but also contributes greatly to the simplicity 
of the two-step procedure. 


Finally, it should be emphasized that the development of 
the current one- and two-step procedures involves many bold 
assumptions. In future development it is hoped that the validity 
of these assumptions may be evaluated by using a rigorous 
matrix formulation. It is also hoped that the limitations imposed 
by these assumptions can be at least partially removed. 


Lewis Research Center 

National Aeronautics and Space Administration 
Cleveland, Ohio, February 6, 1986 
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Appendix — Mathematical Foundation for the section Analysis 


The existence of the supremum £“ defined in equation Proof 
(25) is shown here. With the aid of equations (I-Bl 8) and (18), 
equations (15), (21), and (24) imply that et 


\nax * ^max ^ y 
^min * ^min 


(Al) 


Fis^tuh)^- [d(s,) 2 + c(s 2 f + 2 b Sl s 2 t x t 2 ] 

[a'Cs'i) 2 + c'(s 2 ) 2 + 2b's l s 2 t i t 2 ] 


(A8) 


Since the upper bound of E given in equation (Al) is 
independent of the integers K and L , the existence of E°° is 
established. 

To prove that (1) E 00 > E* and (2) E°° = E* if and only 
if eq. (27) is satisfied, the following Lemmas are established: 


where s\, s 2 , t\, and t 2 are real variables satisfying equations 
(I-B5) and (I-B7). Using equations (11), (12), (15), and 
(I-B13) to (I-B17), one concludes that 

I (W) = F(s x ,s r t x ,t y ) (k,t) € ¥ (A9) 


Lemma 1 

Let 

//(V,,v 2 ) d l f - [a(v,) 2 + c(v 2 ) 2 + 2 fcv,v 2 ] 

[a'(v,) 2 + c'(v 2 ) 2 + 2b'v,v 2 ] (A2) 

where Vj and v 2 are real variables satisfying 

(Vi) 2 + (V 2 ) 2 = 1 (A3) 

Then the supremum and the infimum n of the 
function Hover its domain exist. Furthermore, H max >H min >0, 
and there exist real members v*, vf, vf , and vf , such that 

(V, + ) 2 + (v 2 + ) 2 = (Vf ) 2 + (vf) 2 = 1 (A4) 

tf(vr,V 2 + ) = #max > 0 (A5) 

and 

H(vf ,vf ) = H^ n > 0 (A6) 


Proof 

Since H is continuous over a compact domain, H m ^ H min , 
vf , vf, vf , and vf must exist. It follows from equations 
(1-4) and (16) that > H ^ > 0. 


Furthermore, a comparison between equations (A2) and (A8) 
reveals that 


H(v ,,v 2 ) = F\ |V]|, |v 2 |, -p— 

~\ kl kl, 


if V] 5 * 0 and v 2 ^ 0 (A10) 


Let V] and v 2 be two given real numbers satisfying (1) 
(vi) 2 + (v 2 ) 2 = 1, and (2) v { 9* 0 and v 2 ^ 0. Following a 
line of arguments (eqs. (I-B21) to (I-B31)), which was used 
in reference 1 to establish the existence of the integers K 0 and 
L 0 , one can show that for any 6 X > 0, d y > 0, e* > 0, and 
e y > 0 there exists a pair of integers K 0 and L 0 such that for 
any K> K 0 and L> L 0 two integers k and t can be found to 
satisfy the requirements 


s* > k - Ivil! 

d y > — |v 2 || 




*1 

kl 


e y > 


V 2 

|v 2 | 


Lemma 2 Thus, in the case having v, + jt 0, v 2 + ^ 0, vf ^ 0, and 

vf ^ 0 equations (A9) and (A 10) along with the continuity 
of the function F can be used to show that for any £ > 0 there 
exists a pair of integers K and L large enough that (k,l) € ¥ 
and (£' J f ) € ¥ can be found to satisfy the requirement 


u 

£oo > fJ max 

= _ 1/ 

T*min 
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£> 


7 (W) 

y{k',C) 


//(v.W) 

H(vr,v 2 -) 


Case 2. X|Xi>X 2 X 2 .— For this case Min /(0),/(1)}=/(0), 
and 


With the aid of equations (24) and (25), one concludes that 
inequality (A7) is true if vf 5* 0, v 2 + 5* 0, vf 5* 0, and v 2 " ^ 0. 
Since H(0, ± 1) = cc' = x (0/) , and H{± 1,0) =da' = 
inequality (A7) is valid for all vf, v 2 + , vf , and vf which 
satisfy equations (A4) to (A6). QED. 

Lemma 3 
Let 

/(0) d i f [\,0 + X 2 (l - 0)][Xj0 + X 2 (l - 0)] > 0 1 2: 0 > 0 

(All) 

where Xj, X 2 , Xj, and X 2 are given positive numbers. Then 
(1) The supremum f nm and the infimum f mn of the function 
/ over its domain exist, and/ max >f min >0. 

(2 ./max > /( 1/2) ^ (X, + X 2 ) 2 

/min Min [/(0),/(l)} 4X]X 2 

(3) /nax _ (Xi + X 2 ) an( j on ^ jf X, X 1 = X 2 X 2 

/min 4X^2 

Proof 

Part (1) is obvious. To facilitate the proof of (2) and (3), 
three exclusive cases are studied separately. 

Case L XjXi = X 2 X 2 .— For this case 


/(1/2) (X, + X 2 ) 2 _ (X, + X 2 )(X,X,' - X 2 X 2 ) ; Q 

/( 0) 4XjX 2 4XjX 2 X 2 

Case 3 . X 2 X 2 >X 1 Xi.— For this case Min {/*(0),/(l)]=/(l>, 
and 

/( 1/2) __ (X] + X 2 ) 2 _ (Xj + X 2 )(X 2 X 2 — X t X !) ^ ^ 

/( 1) 4X^X 2 4X^X 2 Xi 

QED 

Lemma 4 

^2 E* 

^min 

with the understanding that the equality sign is valid if and 
only if equation (27) is satisfied. 

Proof 

To proceed, the first quadratic form on the right side of 
equation (A2) is diagonalized (ref. 3) by using an orthogonal 
transformation 

(vi, v 2 ) - (v u V2). 


m = X 2 X 2 [X^ + X 2 (l - 0)] 


e l -e 

— i 

X, x 2 


Thus, 

(Vi ) 2 + (^) 2 = (v,) 2 + (v,) 2 = 1 (A12) 


and thus 


and 


4XjX 2 

= /( 1) = X 2 X 2 1 > 0 > 0 
As a result 

/max _/( 1/2) 

/min /(0) 

/( 1 / 2 ) 

/(l) 

_ (X, + X 2 ) 2 
4X^X 2 


H(v lt V 2 ) = H(V U V 2) d i f - [x max ( V, ) 2 + X^fw,) 2 ] 

x [V(vi) 2 + £'(v^) 2 + lb' v,^j (A 13) 

where X max and X^ are defined in equations (1-30) and 
(1-31). Let 



There exists an orthogonal matrix U such that 

D = UDU~ l and D f = UD'U~ } (A15) 

where D and D' are the matrices defined in equation (28). 
As a result the matrices D f , like D ' , is positive definite; that 
is, 
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&' > 0 


and 




cj > 0 

a'c' ~(b') 2 > 0 
Let 


(A 16) W) = /(°) 

#d,0) = /(l) 

Thus, with the aid of Lemma 3 one concludes that 


m = f ' [Xmax# + Ul-«)][r« + £'(!- fi)] 

1 > 0 S: 0 (A 17) 

Then equations (A 12) and (A13) imply that 
#(ii,V 2 ) =/(( v,) 2 ) + 2 ^'y,y 2 


^min Min (/(()), /(!)} 


> E* 


QED 


Lemma 5 

Let si, s 2 , !\ , and / 2 be real variables satisfying 
equations (I-B5) and (I-B7). Then 


X [Xn»x( Vj) 2 + Xnun( V2) 2 ] (A18) 

To proceed further, two exclusive cases, (1) h ' =0 and (2) 
b' ^ 0, are studied separately. 

Case 1. b ' = 0— For this case H( y 1( yj) = /((v^ 2 ). 
With the aid of equations (A5), (A6), (A12) to (A16), (26), 
and (1-33) Lemma 3 implies that 

^max > 2* 

^min 

with the understanding that the equality sign is valid if and 
only if equation (27) is satisfied. 

Case 2. b' ^ 0.— For this case, equation (27) is violated 
since DD\ and thus DD' , can not be a scalar matrix. 
Furthermore, 


AE* - F(s u s 2> t u t 2 ) > A 

if the matrices D and D' are related by equation (27). 
Lemma 5 is a special case of theorem 2 in reference 4. 

Lemmas 2 and 4 imply that (1) E°° > E* and (2) 
E 00 > E* if the condition (27) is violated. On the other hand, 
if condition (27) is satisfied, Lemma 5 combined with 
equations (A9), (24), and (25) implies that E* > E°°. Thus, 
one concludes that E* = E°° if and only if condition (27) is 
satisfied. 

Finally, expressions (30) and (31) are shown as follows: If 
the matrices D and D ' are related by equation (27) with A = 1 , 
Lemmas 4 and 5 coupled with equations (A9) and (A10) imply 
that 

£ = #nax £ I (W) 2^=1 (k,() € * (A 19) 


m 


i k j_\ 
V2 \b'\’ sfl) 



\k’\ 

"I r (Xmax "h Xmin) 



(Eq. (A10) may be replaced by tf(v!,v 2 ) = F(v 1 ,v 2 ,0,0) if 
either Vj = 0 or v 2 = 0.) Expression (30) follows directly 
from equation (A 19). Furthermore, with the aid of expression 
(A19), equation (31) can be shown by using a line of arguments 
similar to that presented in the proof of Lemma 2. 
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